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Abstract 

We develop a framework especially suited to the autocorrelation properties 
observed in financial times series, by borrowing from the physical picture of 
turbulence. The success of our approach as applied to high frequency foreign 
exchange data is demonstrated by the overlap of the curves in Figure (1), 
since we are able to provide an analytical derivation of the relative sizes of 
the quantities depicted. These quantities include departures from Gaussian 
probability density functions and various two and three-point autocorrelation 
functions. 
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Financial time series and the unexpected scaling behaviors they display have re- 
cently attracted the attention [|T|-[3 of the theoretical physics community, as large 
data sets have become available. From a financial times series x{t) and the corre- 
sponding set of log price changes Aa; = ln{x{t + At)) — ln{x{t)) for time delay At, 
one constructs the probability density functions (PDFs) Pa* (Ax). There are three 
basic observations 1) PAt(Ax) often departs in a leptokurtic manner (sharper 
peak and close to exponential tails) from the Gaussian shape expected for a random 
walk. 2) Scale invariance is broken in the sense that the shape of Pa* (Ax) evolves and 
gradually becomes more similar to a Gaussian as At increases. More striking is that 
this approach to a Gaussian is slower than expected if the individual price changes 
were independently and identically distributed with finite variance. This anomalous 
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scaling indicates that there is some kind of correlation between the price changes 
at different times (autocorrelation). 3) Although there is negligible autocorrelation 
observed in the log price changes themselves, there is significant autocorrelation ob- 
served in the various functions of the log price changes. It is seen for example in the 
square of the log price changes, and thus is referred to as volatility clustering. 

We make the following definitions, where we assume that the PDFs have mean 
/i = and variance cr^ = 1 (otherwise replace Ax by (Ax — etc.). 

i^(/W,At) = -^^^M^_l (1) 

(/(Ax)) Gauss 
rift ^ ^ (/(Ax)/(Ay)) - (/(Ax))^ 

((/(Ax) - (/(Ax)))2) 
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F for various functions /(x) will provide different measures of the departures from a 
Gaussian PDF, as a function of the time delay At. G is the autocorrelation function 
for a lag time r where Ax is as above and Ay = ln(x(t — r + At)) — ln(x(t — r)). H is 
a three-point autocorrelation function where Az = ln(x(t — f + At)) — ln(x(t — |)), 
and it is constructed so that it receives no contributions from autocorrelations on 
scales smaller than r. Note that for the autocorrelations. At is typically constrained 
to be much less than r. F, G and H all vanish for a Gaussian random walk. 

We consider the data set made freely available in association with the Santa Fe 
Competition [|TU|. It consists of 329112 quotes (bid and ask) of the Swiss franc-US 
dollar exchange rate between May 20, 1985 to April 12, 1991. The time between 
price quotes is on average a few minutes, but it is very irregular. From this data 
we extracted a time series with a fixed time step of 10 minutes and a total of 84914 
prices. To do this we averaged the bid and ask quotes, used linear interpolation to 
determine a price at each time, and eliminated the prices changes between days. 



In Fig. (1) we plot as functions of At or r the quantities F{fa{x), At)/(a, G{fa{x), T)/^a 
and H {fa{x) , t) / Ka for a = 1,2, ...,6. The functions fa{x) and our calculated con- 
stants (Ca, ^a, /^a) (which are independent of the normalization of fa{x)) are specified 
as follows, where r 
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The striking result is that the quantities plotted in Fig. (1) are nearly independent of 
the function fa{x). This paper will provide an understanding of this result through 
the derivation of the {Ca,^a,i^a) constants. From the range of these constants we 
see that the functions fa{x) are probing the characteristics of the price changes in 
different ways. Our method also applies to any other functions of choice, and at 
the end we shall comment on functions which are more sensitive to very large price 
changes. 

We will motivate a one parameter family of PDFs with the following properties. 
1) They are derived in closed form. 2) They are leptokurtic with exponential tails. 3) 
They are closed under convolution. 4) Their derivation is linked to autocorrelation. 

The similarity between PDFs in finance and turbulence has recently been empha- 
sized in [0. For turbulence in a fluid, PDFs are obtained for the velocity differences 
at two points separated by some distance. The leptokurtic shapes of these PDFs is 



a reflection of the intermittency of turbulent flow |Tl|. The origin of this intermit- 
tency lies in the coming and going of coherent structures of various sizes, or eddies, 
in the velocity field. The essential point is that volatility in the velocity differences 
can be correlated over distance scales typical of the sizes of the eddies. It is the idea 
of 'volatility structures' of varying time scales, rather than size, which we will carry 
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over to financial time series, where price changes replace velocity differences. We will 
discuss further the analogy between turbulence and financial time series at the end. 

To develop a simplified picture we define a volatility structure (VS) of time scale 
T as follows. Take the total time of the series and split it up into intervals of length 
T. Let the volatility be constant on each interval, but allow it to vary from one time 
interval to the next. The distribution of these values defines a PDF for volatilities 
on scale T. Our goal is to account for the cumulative effects of VSs occurring on all 
time scales. 

First we consider the effect of a single VS of time scale T on the PDF Pa* (Ax) for 
T ^ At. We consider all pairs of prices at times differing by At; over each of these 
At intervals the volatility will be constant (ignoring those few intervals which happen 
to straddle a boundary), but the value of this volatility can be different for widely 
separated intervals. Thus the PDF will follow by summing over all random walks of 
duration At for all values of the constant volatility. This leads to a superposition of 
Gaussian PDFs, 

/•OO 

Vrj{a,Ax)= p^{S)PG>,nss{Sa,Ax)dS, (5) 



where Pcaussio', Ax) = J- e and is the variance. 

Pr]{S) with 5 > is the PDF for the volatility. To produce a family of PDFs with 
the properties listed above we make the following choice.^] 

MS) = ^f-'^-''^ (6) 

We note that this includes the simple Gaussian, Pi('S') = \J^e~~ . The volatility 

PDF has a mean which approaches unity from below as t] increases, while its variance 

-'^An alternative choice for PjAS), proposed in both the financial and turbulence literature 
and advocated more recently |§, ||, is the lognormal PDF. The drawback is that the resulting family 
of PDFs cannot be obtained in closed form, and they are not closed under convolution in the sense 
we describe. 
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Thus for large rj, p-q{S) becomes sharply peaked around S = 1 and Prj tends towards 

-Pcauss • 

The integral in (H) can be evaluated to yield Vrj{a,Ax) = Qi,r,(cr, r) where r = 
Qi,r,{cr,r) = Qi^^{l,r/a)/cr and 

Q..(l,r) = -^2'-i-^i^^r^-^K^.i^r). (8) 

Kjj{x) is the modified Bessel function of the second kind and i ^ 1 will be of interest 
below. These PDFs have a distinctive leptokurtic shape which becomes more promi- 
nent for decreasing rj. For any positive integer n, P„(l, Ax) is simply a polynomial 
in I Ax I of degree n — 1 times the exponential e"^^'^^'. The appearance of these 
exponential tails can be traced to the Gaussian factor in (^. The simplest member 
of this family of PDFs is purely exponential, 7^1(1, Ax) = -^e~^'^^'. 

We now consider the PDF PAt(Ax) which results from a single VS with T < At. 
For example with At = nT we must consider the convolution 

where the Xj are the log prices at regularly spaced intervals ti — ti_i = T . This 
convolution accounts for the variable volatility on each interval. We now make use of 
the fact that our family of PDFs is closed under convolution. In particular the results 
of [0 are sufficient to prove the following relation for half- integer rji and rj2. 

J dyVr^^{^(y, X - y)Vr,^{^a, y - z) = V^.+n^iVVi +^20", x - z) (10) 

Thus the PDF PAt(Ax) determined by @ is Vnrii^/na, Xn — xq). 



There are three points worth stressing for this case of a VS with T < At. 1) 
The variance of Pa* (Ax) increases exactly hnearly with At for fixed T. (This is 
trivially true for the previous case, T ^ At.) This linear growth of variance is a well 
documented feature of financial time series which can be related to the vanishing 
of G{f{x), t) for f{x) = X 0. 2) From the relation (0) between a^^i and rj we see that 
the effective variance in volatilities cr^oi on time scale At is decreasing as 1/At. In 
other words as the number n of intervals with independently fluctuating volatilities 
increases the effective PDF tends towards a Gaussian, in a manner as expected from 
the central limit theorem ||15|. 3) We now see that the whole family of PDFs in (^) 
are nothing but convolutions of the simple case when the volatility PDF Pr){S) is a 
Gaussian {rj = 1/2). 

We next consider the autocorrelation between price changes Ax and Ay, each 
occurring over a time step At, but separated by a lag r > At. For a VS with T ^ t 
we must sum over all pairs of random walks of duration At for all values of a common 
volatility. The joint PDF is then 

Jr,{a,Ax,Ay)= pr^{S)PGanss{Sa, Ax)Pg>,uss{S(t, Ay)dS. (11) 

J 

More generally an i-point joint PDF can be defined in the corresponding way, and 
we obtain 

Jr,{a, Axi, Ax2, Axi) = Qi,^(a, ^(Axi)^ + (Axs)^ + ... + (Axi)^), (12) 

where Qj^^(l,r) is defined in (H). 

From this joint PDF we may extract for example the PDF for the following com- 
bination of the m adjacent Ax's within time mAt: Vm = y^Z) adj acent ( ) ^ . [| Thus for 
very similar quantity was recently studied in 
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VSs with T > mAt we obtain the PDF 7^„,^(^;„) = 27r-t;^"'-^Q„,^^(a, t;„)/r(f ). 
This cannot be directly compared with data because it does not account for VSs with 
T < mAt. 

We return to the quantities F, G, and H in (|1|^), for which we can account for VSs 
occurring on all time scales T. For G and H only VSs with T larger than r contribute, 
and the cumulative effect of such VSs can be represented by an effective cT^Qi(r). From 
the inverse relation (|^) between a^^i and rj this corresponds to an effective rj which 
will specify J7',,(cr, Ax, Ay, ...), which in turn enters the determination of G and H. G 
and H vanish only in the large rj limit. 

For the PDF Pa* (Ax), VSs of time scales both larger and smaller than At con- 
tribute. We have seen how each VS with T < At makes a contribution to an effective 
(j^oi(At) suppressed by Tj At. These contributions can be added to the contribution 
of VSs larger than At to yield a total effective a1^^{At). The corresponding effective 
7] then specifies a PDF 'P,j(cr, Ax) which can be used to determine F. Unlike the case 
of a single VS with T < At, the effective a^^i{At) may now fall slower than 1/At. 
This corresponds to the slow approach to a Gaussian, which is the anomalous scaling 
observed in the data. Thus in our picture it is the presence of VSs on all time scales 
which underlies the link ^ between this anomalous scaling and the nonvanishing 
autocorrelations. 

To compare with data we first use the 1, 2, and 3-point PDFs, Vr, and JT",,, to 
determine F, G, and H as functions of rj. 

Fifaix),v) = ^-^, G(/,(x),r^) = ^, HiUx),v) = ^ (13) 

7] 7] 7] 

We find that Ca{v)y ^a{v) '^aiv) ^-^^ rather slowly varying functions which tend 
to constants for large rj. In principle we could extract F(/a(x), At), G{fa{x),T), and 
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H{fa{x),T) from the data and by comparing with ([T3|) , determine the appropriate rj 
for each At and r respectively. The prediction would be that the values of rj and the 
values of -F/C, G/^ and H/k, are independent of fa{x)- In fact for the range of r] we 
are considering we can, to good approximation, replace the functions Ca{v)y ^a{v) 
Kairf) by constants. The values hsted in correspond to (Ca(3/2), ^a(3/2), /ta(3/2)), 
which in turn yield the results in Fig. (1). In addition to the overlap of the curves 
we also note the close similarity of Figs. (IB) and (IC). This strongly supports our 
common description of all i-point autocorrelation functions in terms of the function 
Qi,^(a,r) in (|). 

In general our model provides a good description of the PDFs and the autocorrela- 
tions, with the exception being quantities sensitive to abnormally large price changes. 
For example when F/C^ with /(x) = x'^ is extracted from the data we obtain the top 
curve in Fig. (lA). {F{x'^,ri) is directly related to the kurtosis of our PDFs, and in 
this case we have simply ([t]) = 1.) This indicates that the extreme tails of the 
true PDFs fall off somewhat slower than expected, especially for shorter time delays. 
More striking are the autocorrelations involving large price changes; for example G/^ 
and H/k, with f{x) = gives the bottom curves in Figs. (IB) and (IC) respectively. 
Thus large price changes show much less autocorrelation than expected. This sug- 
gests that large price changes more strongly reflect the reaction of the markets to 
external events. 

We conclude by commenting on the analogy with turbulence in a fluid 0, ^, ^ 
^, 1^. The PDFs for turbulence evolve as a function of the distance r separating 
the two points at which the velocities are measured. We have presented a model 



for this evolution based on a formulation related to the present work [|1J]. A basic 
observation is that the evolution becomes more transparent when expressed in terms 



of r oc r^l"^ . (In fact r is a physical time scale in turbulence, the so-called turnover 
time [0.) In particular the variance of the PDF is proportional to r, while its shape 
evolves from the leptokurtic towards the Gaussian as r increases, making the analogy 
between financial time series and turbulence very close. The difference for turbulence 



in a fluid is that the data is quite consistent JI^ with a speed of approach to the 
Gaussian as described by the central limit theorem, a^^^ oc 1/r. Thus contrary to 
financial time series, volatility structures (eddies in this case) on large r scales are 
not sufficiently strong to cause a significant slowing down in the approach to the 
Gaussian. The turbulence in the financial markets is stronger in this sense than the 
turbulence observed in a fluid. 
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Figure (1): A, B, and C each display six curves with significant overlap. F{fa{x), At)/(f 
G{fa{x), At)/^a and H{fa{x), At)/Ka are defined in (|l]-^ with the six functions fa{x) 
and constants {(a, ^a, i^a) specified in (^). The additional dashed line in A corresponds 
to f{x) = x^, while the dashed line in each of B and C corresponds to f{x) = x"^. To 
produce B and C we have removed from the data the well known correlation between 
volatility and time of day. This changes the shape of the curves but not their overlap. 
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